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Electronic data processing circuit that transmits packed words via a bus 



The invention relates to an electronic data processing circuit that conqirises a 
bus and a plurality of data handling units that have access to the bus. 

An address/data bus is a well-known solution for allowing multiple data 
handling units to get access to shared resources, such as memories. Conventionally; one data 
S liflnHling unit at a time gets access to the bus, to place data and a corresponding address on 
the data lines and address lines of the bus respectively. 

Modem data busses are vecy wide, pennitting words with many bits (e.g. 64 or 
128 bits) to be placed on the bus in a single bus cycle. Data handling circuits do not always 
use all these bits, because aSssa the size of a word that has to be written is less than the 
10 tnfiviiTium word size. For example, often numbers of 32 or even 16 bits are used: 

US patent No. 6,366,984 discloses how this redundancy can be used to 
increase memory bandwidth by packing different words that have to be written fix)m a cache 
memory to a main memory at adjacent addresses. The packed words are placed on the data 
lines of the data bus in parallel in the same bus cycle. The circuit of US patent No. 6,366,984 
1 S waits b efbre writing back iq^ted data words fiom the cache memory if the data words span 
less than the fiill bus widtfi. The circuit compares tbe addresses to determine whether the 
addresses of the data words are adjacent If so the data words are packed and written in a 
single bus cycle. Similarly EP 465320 discloses a write packer (see e.g. fig 19 element 301) 
that collects write requests and compares the addresses from the requests to determine 
20 whether data from different requests can be packed into a single bus cycle. 

Of course this form of packing is limited by tixe maximum word size (the 
nuihb^ of data lines) of the bus. Writing to a range of addresses that spans more than this 
word size generally involves multiple bus cycles, but in this case the supply of multiple 
addresses can be avoided by using a start address and a Iragth code, that enables a data 
25 receiving device, such as a memory to compute the relevant addresses for the data firom 
different bus cycles internally. 

These documents show how one can reduce the number of bus cycles that is 
needed to write multiple data word of less than maximum length. This leaves more bus cycles 
for use for other data transfers, which results in a reduction in the number of potential bus 
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conflicts and an overall increase in speed It will be clear that the reduction of the number of 
cycles depends on the adjacency of addresses used in write actions. When the addresses are 
unrelated, no reduction in the number of bus cycles is possible. In this case these publications 
give no reason for packing words into a larger wprd. 
5 Apart firom causing potential access conflicts that reduce execution speed, 

address/data busses also cause considerable power consumption. Bus data lines and address 
lines have to extend over considerable distances because tiiey connect to different units of the 
circuit In an integrated circuit the bus lines usually extend over most of the chip size. Thus, 
bus lines are generally much longer than internal lines in the units of the circuit The great 
10 length of the bus lines means that strong driver circuits are needed, for example to charge the 
capacitance associated with the bus lines. 

Among others it is an object of the inventicm to reduce the power consumption 
involved in passing read and/or write data via a data bus. 

Among others it is a fiirther object of the invention to reduce power 
15 consumption by reducing the number of bus cycles in which new data has to be placed on the 
data lines of a bus. 

Among otiiers it is an object of the invention to increase the useful available 
bandwidth for passing data over a bus in an electronic circuit 

An electronic circuit according to a first aspect of the invention is set fortii in 

20 Claim 1. According to this aspect data words fix)m different write or read requests 

togetfa^ on the data lines of the bus in the same cycle if the size of these data words is 
smaller than the mMtmnni word size supported by the bus. According to this aspect write 
addresses associated with write data are placed on the bus in a plurality of different cycles, so 
that a data receiving circuit or data receiving circuits can use different bus cycles to obtain 

25 the different addresses associated with the different data words that have been placed on the 
bus in parallel. 

Typically, one or more of the data handling units that write the data are able to 
write data words of differentsizes. SimilarLy,_memory units may return read data of different 
sizes. These units are able to signal the size of the words. Dependent on the signalled sizes 
30 more or fewer data words are combined in a single cycle. Thus, if one imit produces a word 
with a size (e.g. 64 bits) that fiUs the whole data bus, a single address will be output before 
new data is placed on the data bus. If two data handling units write half size words (e.g. 32 
bits each), the two words are placed on the data lines together in a single bus cycle and two 
bus cycles with addresses for these words are used (one of these bus cycles may coincide 
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with the bus cycle in which the data words were placed on the bus). Similarly, if four data 
handling units write quarter size words (e.g. 16 bits each), the four words are placed on the 
data lines to^th^ in a single bus cycle and four bus cycles wiHi addresses for these words 
are used etc. 

5 In an embodimmt the addresses are placed on the address lines in a sequence 

that has a predetennined relation with the position of the corresponding words. Thus, for 
exan^le the write address fi>r die quarter word at a first p osition on the data lines is ovOpxA in 
one bus cycle, followed in the next bus cycle by the write address for the quarter word at a 
second position on the data lines etc. In another embodiment a signal line may be provided to 

10 signal to "vdiich position an address corresponds. 

According to another aspect of the invention the bus controller selects the 
position of a word on the data lines and/or a bus cycle in which a word is ou^ut so that the 
number of logic level changes that is needed to replace the previous data on the data lines 
with that word is niininiized. Thus, i>ower consumption is minimized. In case of four quarter 

1 S sized words that are placed on the data lines in one bus cycle, for example, twenty four 

position sequences of words are possible. The bus controller preferably selects the position 
sequence according to the number of bits that have to diange logic level. Preferably, tiie bus 
controller selects the position sequence that requires the absolute minimum of changes (i.e. 
among all twenty four possibilities with four quarter words), but without deviating from the 

20 invention the bus controller may select from fewer possibilities, or merely according to a 
criterion that ensures that the selected position sequence involves less level changes than 
another possible sequence. Thus, power consumption is reduced although the absolute 
mi'ninnnn may not be realized. Similarly, the bus controller may select the tenq)oral sequence 
in which words are placed on the data lines in successive bus cycles so as to minimiz e the 

25 required number of transitions. 

In another embodiment, part of the different bus cycles that are used for 
supplying the addresses of combined words is used to siqiply data which requires no address, 
for example read data or data from a data block for which first a starting address was 
supplied, so that the data receiving circuit can coxi:q)ute successive addresses internally. This 

30 reduces the number of bus cycles that is needed to siq)ply both data and addresses during 
execution of a given application program and thereby the power supply consumption 
associated with the program. 

In another embodiment, read data words may be packed on the bus with write 
data words. A read data word is produced in response to a previous read address. Therefore in 
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this embodiment successive addresses do not have to be suppUed for all data words that are 
packed together. This may be used to reduce the number of bus cycles for supplying 
addresses, for example to be able to place new data on the bus sooner, or to supply other 
addresses, such as addresses for later read operations. 

These and other objects and advantageous aspects of the invention will be 
described in a non-limitative way using the following figures 
Figure 1 shows an electronic drcuit 
Figure 2 shows a part of a bus inter&ce 
Figure 3 shows a further part of a bus interface 
Figure 4 shows apart of a bus connection 
Figure 5a shows a memory 
Figure 5b shows a further memory 
Figure 6 shows a further electronic circuit 
Figure 6a shows a multiplexer/driver 
Figure 7 shows a further electronic circuit 

Figure 1 shows an electronic circuit, containing a plurality of processors 10a- 
d, a bus int^rfiice 12,abus 14 and a plurality of memories 16a,b. Each processor lOa-d has an 
address output A, a data ou^ut D and a control input^output. Bus inter&ce 12 couples the 
address and data outputs to bus 14, to which memories 1 6a,b are coiqiled to receive address 
and data information. Bus 14 comprises a plurality of address lines, a plurality of n data Unes 
(eg. n— 64 or if=12o data lines) and control lines. 

For the sake of illustration processors lOa-d are shown to have data outputs 
only, but it should be understood that they may have data inputs, or a data input^output 
coupled to bus inter&ce 12. Although processors lOa-d are shown, it should be understood 
that any other kind of data handling circuit may be used. Similarly, although two memories 
are shown connected to bus 14 for the sake of illustration, it will be understood that many 
other circuits may be coimected to bus 14, not necessarily all memories, or that only a single 
circuit may be coimected to bus 14. 

In operation processors lOa-d produce data and write this data to locations in 
memories 16a,b. For this purpose, processors lOa-d generate write requests as part of which 
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they oulput addresses and data to bus interfece 12. Bus interfece 12 operates in bus cycles. In 
each bus cycle the bus intec&ce passes data bits to the n data lines of bus 14 in parallel. 
Similarly, the bus intet&ce passes one address in eadi bus cycle. Memories 16a,b receive the 
addre^es fiom bus 14 and use them to select locations whece the data is written. If necessary, 
iabec&ce 12 arbitrates between conflicting write requests, to sequence data and addresses 

fix)m different processors lOa-d. 

Dependent on flie type of data words that have to be written, a greater or lesser 

number of bits has to be passed from processors lOa-d to bus 14. On a control ou^ut 
processors lOa-d output a code to indicate the number of relevant data bits in the word, e.g. 8 
bits, 16 bits, 32 bits or 64 bits. Bus interface 12 tests these codes and, if possible, bus 
interface 12 packs multiple data words from different ones of proc^sors lOa-d. That is, bus 
interim ou^ruts data words from different processors lOa-d in parallel on the n data lines of 
bus 14 in the same bus cycle. For example, two o/l long data words may be oo^ut in 
parallel, or four n/4 long words. The corresponding addresses of the different words are 
output in successive bus cycles. This is illustrated in table I. 



Table I example of bus occiqmtion 



Bus Cycle No 


Address 


Data 


1 


Al 


DalDa2DdlDd2 


2 


A4 




3 


Al' 


Dal'Dbl'Dcl'Ddl' 


4 


A2' 




5 


A3' 




6 


A4' 




7 


A2" 


Dbl"Db2"Db3"Db4" 


8 







Table I shows a number of successive bus cycles (numbered 1-8), the 
addresses output on bus 14 for these cycles and the data output on the data lines of bus 14 in 
these QTcles. The data output fiom a first processor 1 Oa for use in one bus cycle is assumed to 
be composed of four parts labeUedDal, Da2, Da3, Da4 (each part e.g. 16 bits long). 
Similarly the ou^ data firan the second, third and fourth processor lOb-d for use in one bus 
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cycle are assumed to be composed of four parts each, labelled Dbl, Db2, Db3, Db4 and Del, 
Dc2, Dc3, Dc4 and Ddl, Dd2, Dd3, Dd4 respectively. 

It is assumed that initially the first and fourth processor 10a,d indicate that 
they output half size data words (n/2 bits), next four processors lOa-d indicate lhat they 
output quarter sized data words (n/4 bits) and next second processor 10b indicates that it 
ou^uts a full sized data word (n bits). 

In the bus cycle numbered 1 bus inter&ce 12 places the data bits from the two 
half size data words from the first and fourth processors 10a,d on the n data lines of bus 14 in 
parallel. This is indicated by the parts Dal, Da2, Ddl and Dd2 in the entry for bus cycle 
number 1. Bus inter&ce ou^uts the addresses Al and A4 for these data words &om the first 
and fourth processor 10a, lOd in successive bus cycles 1, 2. (Although it is shown that one 
address is ou^ut in the same bus cycle as the data it will be understood that, due to pipe- 
lining, other relations b^ween address and data timing may pertain). 

In the bus cycle numbered 3 bus intec&ce 12 place the data bits from the four 
quarter size data words &om the processors lOa-d on the n data lines of bus 14 in parallel. 
This is indicated by the parts Dal', Dbr, Del' and Ddl' in flie entry for bus cycle number 3. 
Bus inter&ce outputs the addresses Al', A2', A3' and A4' for these data words from the 
processors lOa-d in successive bus cycles 3-6. 

In the bus cycle numbered 7 bus interface 1 2 place the data bits fix>m the data 
word fix)m second processors lOb on the n data lines of bus 14 in parallel. This is indicated 
by the parts Dbl", Db2", Db3'' and Db4"in the entry for bus cycle number 7. Bus interfece 
outputs the address A2" for this data word fix>m second processor lOb in bus cycle 7. 

Although fiill occupation of the data lines has been shown for each bus cycle 
in "f^ch new data was placed on the bus, it will be understood that, if processors lOa-d 
ou^ut iztSufScient data, the n data lines need not ail be used: e.g. three n/4 bit data words 
may be placed on the data lines of bus 14 in parallel or just one or two. Similarly, although 
all data words were shown to be of the same size, it will be understood that in fact also data 
words of mutually difiTerent size may be placed on the data lines of bus 14 in parallel in the 
same bus cycle: one half size (n/2 bit) data word and two quarter size (n/4 bit) data words for 
example or one half size (n/2 bit) data word and one quarter size (n/4 bit) data word 

In an embodiment the apparatus supplies read data in a bus cycle wherein a 
write address but no write data is supplied As described, less than full size write data words 
for different addresses are placed on the data lines of bus 14 concurrently in one bus cycle, 
and the addresses for these combined data words are placed on the address lines sequentially 
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in a plurality of bus cycles. Thus the data lines of the bus are occiqiied in a bus cycle that is in 
a specific relation to the bus cycle in which one of the address^ is supplied, but the data lines 
of the bus axe fiee in at least one bus cycle that is in the same specific relation to the other 
bus cycle or cycles in which addresses are supplied. In this embodiment read data (siqiplied 
S in response to an earlier read address) is supplied via the data line in this olher bus cycle or 
tiiese other bus cycles. This is illustrate in table la 



Table la reuse of bus cycles for read data 



Bus Cycle No 


Address 


Data 


1 


Al 


DalDa2DdlDd2 


2 


A4 




3 


Al' 


Dal'Dbl'Dcl'Ddl' 


4 


A2' 


Rl 


5 


A3' 




6 


A4' 


R2 


7 


A2" 


DbrDb2"Db3"Db4" 


8 







10 

In this exan^le the bus cycles numbered 4 and 6 are used to supply read data 
words Rl and R2 on the data lines of the bus. 

Figure 2 shows an example of a data part of bus interface 12 that may be used 
to place data words from different i)rocessors lOa-d on the data lines of bus 14 in parallel. 

15 The inter&ce contains a control circuit 20 and a first, second and third multiplexer 22, 23, 24. 
Multiplexers 22, 23, 24 are controlled by control circuit 20, which receives control signals 
fix>m processors lOa-d and outputs a control signal on bus 14. First multiplexer has inputs 
coupled to four groins of data lines Da, Db, Dc, Dd from processors lOa-d respectively. Each 
group of data lines contains four sub-groups (eadi shown as a single line, although it should 

20 be understood that each sub-groiqi contains a plurality of lines in parallel (e.g. 8 or 16 lines). 
The lines from the first two subgroups of each of three of the groups Da, Db, Dc are coupled 
to second multiplexer 23. Third multiplexer 24 has first inputs coupled to a groiqp of data 
lines 25a output by first multiplexer 22, second input coupled to a second group of data lines 
2Sb, which includes two sub groiq>s output by second multiplexer 23 and two of the sub- 

25 groups from tiie first irput 2Sa. Furthermore third mult^lexer 24 has a thurd icput coupled to 
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third group of data lines that are coupled to the first subgroups of each of the four groups of 
iaput data lines Da, Db, Dc, Dd Third noiultiplexer 24 has a group of ou^ut lines 26 that are 
coupled to the data lines of data bus 14. 

In operation first multiplexer 22 coiqiles the group of data bitUnes Da, Db, Dc, 
S Dd fix>m a selected one of processors lOa-d to its ou^ut at bit lin^ 2Sa* Second multiplexer 

23 couples two sub-groi^s of data bit lines j&om a selected one of Ifaree of the processors 
lOa-c to two of the sub-groups of second input 2Sb of third multiplexer 24. Third multiplexer 

24 coupled the data bit lines from a selected one of its inputs 25a-c to its oulputs 26. Control 
circuit 20 controls selection by multiplexers 22, 23, 24 dependent on control signals firom 

10 processors lOa-d. ' 

When the control signals firom processors lOa-d indicate that a fhll size (urbit) 
data word is supplied on one of the groups of data lines Da, Db, Dc, Dd of processors lOa-d, 
control circuit 20 signals first multiplexer to pass the signals fiiom tbat gjcoup of data lines to 
output lines 25a. In diis case control circuit 20 signals third multiplexer 24 to pass the signals 

1 5 fiom the oulput 2Sa of first multiplexer 22 to ou^ut lines 26. 

When the control signals fi'om processors lOa-d indicate that four quarts sized 
(n/4 bits) data words are siqipliedby each of the groups of data lines Da, Db, Dc, Dd of 
processors lOa-d, control circuit 20 signals third multiplexer 24 to pass the signals finom the 
third input 2Sc, which is coupled to sub-groups that contain one quarter word of each group 

20 of data lines Da, Db, Dc, Dd firom all processors. 

When the control signals firom processors lOa-d indicate that two half size 
(n/2-bit) data words are suppUed on two of the groups of data lines Da, Db, Dc, Dd of 
processors lOa-d, control circuit 20 signals first multiplexer to pass the signals firom one of 
those groups (firom fourth group lOd if that is one of the two groups). Furthermore control 

25 circuit 20 signals second multiplexer 23 to pass signals fircm the sub-groups of data lines that 
contain the other half word In this case control circuit 20 signals third multiplexer 24 to pass 
the signals firom its second input 25b that combines the half sized words firom the outputs of 
first and second multiplexer 22, 23. 

Figure 3 shows the part of bus inter&ce 12 that passes addresses fix)m 

30 processors lOa-d. The interface contains registers 30a-d for storing the addresses output by 
processors lOa-d and an address multiplexer 32 to pass selected ones of the addresses firom 
registers lOa-d to the address lines of data bus 14. When the control signals firom processors 
lOa-d indicate that a fiiU size (n-bit) data word is supplied on one of the groups of data lines 
Da, Db, Dc, Dd of processors lOa-d, control circuit 20 signals address multiplexer to pass the 
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address Al, A2, A3. A4 from &at processor lOa^ When the control signals ftomprocessors 
lOa-d indicate that two half size (n/2-bit) data words are si5)pUed on two of the groiqw of 
data lines Da, Db, Dc, Dd of processors lOa-d, control circuit 20 signals address multiplexer 
to pass two addresses from Al. A2. A3, A4 from those processors lOa-d successively, in 
successive bus cycles. When the control signals from processors lOa-d indicate that fi)ur 
quarter size (n/4-bit) data words are suppUed on four of flie gro,q)s of data lines Da, Db, Dc, 
Dd of processors lOa-d. control circuit 20 signals address multiplexer to pass four addresses 
Al, A2. A3, A4 from those processors lOa-d successively, in successive bus cycles. 

Conliol cireuit 20 outputs a code to bus 14 to signal to memories 16a,b which 

form of packing is used. 

In response to data from flie bus the memories unpack the data from bus 14 
according to this code. Preferably, Ihe sequence wherein the addresses are passed 
corresponds to positions at which the corresponding data words are placed on the data lines, 
so that memories automatically know for which data which address is appUcable. In a forther 
embodimait control circuit ou^uts signals to bus 1 4 to indicate in each bus cycle for which 
part of the data lines the address in that bus cycle is appUcable. 

In the embodiments described thus fer no special selection was made as to 
from whichprocessor lOa^ a data word was passed to which of the data lines of data bus 14. 
According to one aspect of the invention control drcnit 20 makes a selection to determine 
from v^ch processor the data word is passed to which of the data lines . Control circuit 
guides this selection so as to minimize the number of data lines of bus 1 4 that will have to 
undergo a logic level changp. This saves energy, since flie data Unes of bus 14 are generaUy 
very long, so that it requites a considerable amount of energy to realize a logic level change. 
Table n illustrates possible placements of two half size (n/2 bit words) W 1 , W2 



Table n 



first bit lines 


second bit lines 


Wl 


W2 


W2 


Wl 



If in a previous bus cycle the first bit lines carried bits VI and in the second bit lines carried 
bits V2, the number of logic level changes for the first placement is 



h(Wl,Vl)+h(W2,V2) 
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(the function "h" denoting Ibe Hamming distance) and for the second placement 
h(W2,Vl) + h(Wl,V2) 

Control circuit 20 determines which placement yields least logic level 
changes, controls the data connections so that the words Wl, W2 are placed accoitiing to the 
placement with a minimum number of changes. The addresses of the words Wl, W2 are 
supplied accordingly. 

As will be appreciated die selection of the position of the data words involves 
a determination of a "better" placement of words, which involves less logic level changes on 
the data lines of bus 14. Such a determination can be performed in various ways. 

IdeaUy, the best possiTjle placement is selected, but a gain is already realized if 
the selection is such that averaged over many different bus cycles the number of logic level 
changes is reduced. The best possible placement can be selected for example by generating 
all possible placements successively internally in the bus interface, determining the hamming 
distance between each of the placements and the data currently on the bus, reverting to the 
placement that has least distance and only then oulputting that placement to the bus. In case 
of two half words two (2!) placements are possible; in case of four quarter words twenty four 
20 (4!) placements are possible. 

It may be noted that it is not necessary to test aU possible placements to find 
the placement that involves least logic level changes. For example, if a first and second 
placement, which differ only by the exchange oftwo particular data words, have been 
compared with the data on the data lines of bus 14. and it has been found that the first 
25 placem^ is better than the second plscenssnt, then there is no furths" need io consider 
further placements that have the particular data words in the same location as second 
placement 

If the bus inter&ce is not capable of generating all possible placements (as in 
the case of the internee of figure 2) aU placements may be tried that the interfece can 
30 produce. Thus, alfhou^ the placement that will be found is possibly not the best possible 
placement, the placements that are found will on average involve fewer logic level changes 
than any fixed selection of placement that is permitted by the interface. 

Figure 4 shows a part of a data interface wherein control circuit 20 is arranged 
to select aplacement so as to reduce the number of logic level changes. In this embodiment 
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inputs 40a-d supply infonnation from the processors (not shown). Control circuit 20 receives 
the information, and uses it to compute the location at the output bus 14 \^ere information 
from respective processors should be output Subsequently, control circuit 20 controls 
multiplexers 42, 44, 46, 48 to output the data according to the computed locations. 
Subsequently control circuit 20 outputs the addresses of the data from the diflferent 
processors successively. Thus it is not necessary to use multiplexers 42, 44, 46, 48 in a search 
for an optimal placement. The data from the multiplexers 42, 44, 46, 48 may be stored in 
registers before passing to the bus, so that control circuit 20 can select a next placement while 
the previous data is ou^ut to the bus. 

Control circuit 20 may compute Ihe optimal placement for example by 
sequentially conoqputing ttie number of level dianges involved m a number of possible 
placement and selecting an optimal one. As an alternative to sequential comparison of all 
possible placements the circuit may also use a parallel form of coniparison. This comparison, 
too, can be such that it is ensured that the best possible placement will be found, or only 
partial in the sense that the placements that are found tend to be better than others. 

As a s^arate aspect, in the embodiment of figure 4 respective hspots AOard are 
organized so that each of the processors (not shown) supplies mformation to all inputs, 
groups of bits of different significance level being suppKed to different inputs. That is inputs 
40a-d are not organized so that each processor (not shown) supplies information to a 
respective one of the iopuls 40ard. At a first inpxxt 40a each of the processors supplies the 
smallest sub-words (e.g. 8 or 16 bits wide, each single line denoting lines for 8 or 16 bits in 
this case) of the infomiation fix>m that may be placed on the bus. A second input 40b each of 
the processors may supply the additional bits contained in the next larger sub-word (e.g. the 
next 8 bits that form a 1 6 bit sub -word with the information from first mput 40a, or the next 
16 bits that form a 32 bit sub-word with the infonnation from first input 40a). At third and 
fourth inputs 40c,d the processors may supply fiirfher parts of the words. 

Control circuit 20 controls multiplexers 42, 44, 46, 48 d^endent on the 
required word size. If a fiill word size is required control circuit causes each multiplexer 42, 
44, 46, 48 to pass data from the same processor, from respective inputs 40a-d. If two words 
of half word size are required, control circuit 20 causes a first and second multiplexer 42, 44 
to pass data fcom first and second input 40a,b, both fix)m a first selected processor, and 
control circuit 20 causes a third and fourth multiplexer 46, 48 to pass data fcom the first and 
second iiq)ut 40a,b both firom a second selected processor. If four words of quarter size are 
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required, control circuit 20 causes each multiplexer 42, 44, 46, 48 to pass data from a 
respective one of the inputs 40a-d from respective selected processors. 

It will be appreciated that the embodiments shown in the figures merely serve 
as examples and do not limit the scope of the invention. Many alternative embodiments are 
possible. For example, one or more of the processors may be replaced by other types of data 
handling imits, such as I/O interfaces, dedicated processing hardware (e.g. signal encoding or 
decoding hardware etc.) or memories. Although the memories 16a,b are shown coiq)led to the 
bus only as data receiving units, it will be understood that memories 16a,b may also be 
connected for writing data to the bus. In this case they noiay be coupled as shown for 
processors lOa-d. 

Similarly, the invention is of course not limited to the use of four data 
handling units, nor does tihe number of data handling units necessarily coincide with the 
ma x im um nunoiber of sub-words that can be placed on the data bus in parallel. A greater 
number of data handling units may be present^ in which case control circuit 20 may arfaitcate 
access to the bus so that no data is placed on the bus from one or more of the processors, 
even when sub-words from a maximimi number of processors is placed on the bus in parallel. 

La a furllier embodiment, control circuit may be adapted to arbitrate wliich of 
the data handling units get access to the bus dependent on the word size with which these 
data handling units supply data. E.g. if four data handling units supply quarter words and two 
supply half words, then control circuit jneferably grants access to the four data handling units 
with quarter words together or to the two data handling units with half words together, in 
order to promote packing. That is, during arbitration control circuit 20 piefeiably selects a 
word size, from the word sizes of data handling units that want to supply words to the data 
bus and grants access to the data handling units that supply words of the selected size. 

In another cmbcdiment, the circuit is arianged to combine words of different 
sizes on the data lines of the bus, e.g. two words of 16 bits and one word of 32 bits. It will be 
appreciated that this requires a more complicated multiplexing structure. However, in this 
case there is no need to select a word size as part of arbitration between different data 
handling units. 

Although preferably each data handling unit (e.g. processor lOa-d, or memory) 
coupled to the bus has the capability of outputting data words of different sizes, it will be 
understood that without deviating from the invention part or all of the data handling units 
may be able to output only data words of their own particular predetermined size, or of a 
predetermined subset of the sizes that can be handled by bus 14. 
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When all data handling units are capable of outputting only data words of their 
own particular predetermined size, control circuit 20 selects packing according to the subset 
of data hfltidling units that output data together (or more particularly according to the specific 
word sizes of those data handling units). For example, if four data handling units that output 
5 16 bit words are active together control circuit 20 may select packing of four 1 6 bit words, 
and if two other data handling units output 32 bit words control circuit 20 may select jmcking 
of two 32 bit words. Sinularly» not all memories 16a,b need to be able to handle all word 
sizes. In some embodiments words of diflFerent sizes may be written to different memories 
only, or some memories may accept words of multiple sizes whereas others accept words of 
10 only one size, only words of the latter size being to those memories. 

Figure Sa shows an example of an embodiment of memory 1 6a,b that is 
capable of handling only one word size. The memory contains a multiplexer 50, a memory 
circuit 52 and a word selection circuit 54. The data lines fix)m the bus are coupled to the 
inputs of multiplexer 50 (four lines shown, each representing for exanople 8 or 16 data lines). 
15 An output of multiplexer 50 is coupled to a data input of mraiory circuit 52. The address 

lines fix)m the bus are coupled directly (without multiplexing) to an address input of memory 
circuit 52. Word selection circuit 54 controls multiplexer 50. 

In operation word selection circuit 54 receives a signal indicating that four 
successive addresses will be si^plied for data that is supplied in parallel on the data lines. In 
20 this case, word selection circuit 54 causes multiplexer 50 to pass different sub-words ftom 
the data lines to the data input successively, so that these sub-words are written into memory 
circuit 52 under influCTice of successively supplied addresses. A predetermined sequence of 
sub-word selections may be used for example, through which selection circuit 54 cycles 
under influence of a bus clock. But in a different embodiment selection signals may be 
25 supplied from the circuit that packs the words. 

Figure 5b shows an example of an embodiment of memory 1 6a,b that is 
capable of handling more than one word size, hi this embodiment a memory circuit 56 is 
used that has four (e.g. 8 or 1 6 bit) groups of data inputs Da, Db, Dc, Dd, the first group Da 
serving for receiving quarter word data, the first and second group Da, Db together serving 
30 for receiving half word data and all groups Da, Db, Dc, Dd serving to receive full word data. 
A width selection input W of memory circuit 56 is used to select whether quarter, half or full 
words will be written. 

Two multiplexers 5 la,b are provided between the data lines of the bus and the 
first and second groups Da, and Db respectively. Word selection circuit 54 controls 
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multiplexers 51a,b. Two groups of data lines from the bus (single lines shown to symbolize 
the groitps) are coupled to the third and fourth inputs Do, Dd respectively. 

In operation word selection circuit 54 receives an indication whedier quarter, 
half or fiill words are supplied on the bus and signals this to memory circuit 56 accordingly. 
5 In case of quarter words memory circuit 56 reads only from its first gioup of inputs Da and 
word selection circuit 54 controls the multiplexer 51a coupled to this group of inputs to -pass 
data from different groups of data lines of the bus succ^sively. In case of half words 
memory circuit 56 reads only from its first and second groitp of iiqiuts Da, Db and word 
selection circuit 54 controls the multiplexers 51a, 51b coiipled to these group of inputs to 
10 pass data from different pairs of groups of data lines of the bus successively (e.g. first from 
two predetermined groups and subsequmtly from the remaining groups). In case of full 
words memoiy circuit 56 reads all its group of inputs Da, Db, Dc, Dd and word selection 
circuit 54 controls the multiplexers 51a, 51b to pass data from different pairs of groups of 
data lines of the bus to respective ones of the groups of inputs. 
15 It will be understood that, instead of memories, other data receiving circuits 

may be coupled to the bus to receive the packed data. It will be understood that the data 
receiving circuit may have a register (not shown) coiq)led between the bus and multiplexer 50 
or 51a,b. In this case bus interface 12 may be arranged to drive the data lines of the bus only 
until the packed data has been stored in the register, while using a plurality of addresses to 
20 supply addresses for the data that is supplied in that cycle. Thus, power consumption for 
driving the bus is reduced. 

It will be understood that inter&ce 12 may also be realized by means of 
respective multiplexer/bus drivers associated with respective ones of the data handling units. 

Figure 6 shows an embodiment in which data handling units 1 Oa,d (only two 
25 shown, an arbitraiy number simiLtrly connected units may be present) have data outputs 

coupled to data lines of bus 14 via multiplexer/bus drivers 60a,b and address outputs coupled 
to address lines of bus 14 via address drivers 62a,b (single bus lines are shown to symbolize 
groups of conductors, e.g. groups of 8 or 16 data conductors and a group of 32 address 
conductors). Data handling units 10a,d have control connections coupled to control circuit 
30 20. Control circuit 20 has control connections coupled to multiplexer/bus drivers 60a,b and 
address drivers 62a,b of the various data handling units 10a,d. 

Figure 6a shows an exanq)le of an embodiment of a multiplexer/bus driver, hi 
this figure, groups of data lines 600a-d from the data handling units (not shown) are shown as 
single lines. Similarly groiq)S of data lines 602a-d coupled to the bus (not shown) are shown 
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as single lines. The multiplexer/bus driver contains four groups of tri slate drivers 604 (Ihe 
output states being logic high, logic low and high impedance) coupling a first groi^ of data 
lines 600a from a data handling unit 1o a first, secon4 third and fourth groiq, of data lines 
602a-d from the bus respectively. The multiplexer/bus driver contains two groups of tri state 
5 drivers 604 coiqiling a second group of data lines 600b from a data handling unit to the 

second and fourfli group of data lines 602b,d from the bus respectively. The multiplexer^us 
driver contains two groiq)s of tri state drivers 604 coupling a third and fourth group of data 
lines600c.dfiomadatahandlingunittothethirdandfourthgioup of datalines 602c,dfiom 
the bus respectively. TTie control circuit controls which, if any. of the grotqjs of tri state 
1 0 drivers 604 are active, so that a quarter data word from the first group of data lines 600a from 
the data handling unit can be passed to any selected one of the groups of data lines 602a^ 
from thebus. ahalf data wordfiom the first and second group of data lines 600a.b from the 
data handling unit can be passed to the first and second groins of data lines 602a,b from the 
bus or to the third and fourth groins of data lines 602a,b from the bus and a foil size data 
15 word from all of the groups of data lines 600a^ from the data handling units can be passed to 
re^ective groups of data lines 602a-d ficom the bus. 

In operation control circuit 20 receives access requests fixm data handling 
units 10a,d to writing write data and accompaigdng addresses onto bus 14. Control drcuit20 
selects a data handling unit to put a single data word on bus 1 4 or a combination of data 
20 handlingunitstoputacombinationoflessthanfimsizedalawordsonbusM.Conttolcircui^ 

20 signals to multiplexecljus drivers 60a.b which of multiplexet/bus drivers 60a.b should 
drive data from the associated date handling unit 10a,d to data toies of bus 14, and, in case of 
a word of less than fiill size, to which of the groups of date lines of the bus. In case of a 
combination of data, control circuit 20 signals mullq)lexer/bus drivers 60a.b to do so 
25 concmrendy for respective different groups of date lines of bus 14. so fliat words of less than 
fiiU size from different processors are combined on bus 14 in this case. 

Control circuit 20 also signals address drivers 62a.b to drive address 
information from associated data handling units 10a.d to address lines of bus 14. Control 
circuit signals multiplexer/bus drivers 60a,b to do so sequentiaUy with address infimnation 
30 for date words that are put on the bus together. 

In this embodiment the selection of positions of words on the date bus is not 
optimized to minimize the number of level changes. If optimization is desired, date from at 
least part of date handling units 1 Oa,d and date from the bus lines are also suppUed to control 
dicuit. and die control circuit is arranged to select positions in a combined placement of date 
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to reduce the number of level changes. Control circuit 20 then effects the selected 
combination by supplying coiresponding control signals to multiplexer/bus drivers 60a,b. 

Moreover, memory 16a may be cotipled to control circuit 20 to request access 
to the data lines of bus in order to return read data. Control circuit 20 preferably grants 
5 memory 16a access to the data lines for this purpose in bus cycles in which the data lines 
remain free from write data because the corresponding address appUes to a less than fidi size 
data word that was supplied together with other less than full size data words. 

In one embodiment read requests from data handling units 10a,b to memory 
16a are each accompanied on bus 14 by an identification of the data handling unit 10a,b or 
10 the read request, and memory 1 6a returns the identification with the read data on bus 14, the 
requesting data handling unit being arranged to compare the identification with the 
identification fiom its own read requests and to capture the read data if the identification 
matches. Alternatively control circuit 20 maintains a queue of identifications of data handlmg 
units 1 0a,b that issued as yet unanswered read requests, and, xtpon aUowing read data to be 
15 placed on the bus, signals to the data handling unit 10a,b that issued the oldest unanswered 
read request for the memory unit from which read data is suppKed, tiiat this data handlmg 
unit should read the read data from bus 14. 

Although the invention has so &r been desoribed for packing write data, i.e. 
data that is suppUed in a write request in conjunction with an address, the invention can also 
20 be appUed to read data (supplies in response to an address m a read request) and/or to 

combmations of read data and write data. Of course, read data words of less than fiiU bus size 
can be packed with write data and/or other read data just as aoy data. Ihe difEerence is that 

tiie read address has to be siqq)Ued to tiie bus in advance and need not be supplied with the 
read data. 

25 Figure 7 shows a fiirther smbodimeat in which memories I6a,b have control 

and data ou^ coiq»led to mterfece 12 to mdicate tiie availabiUty of read data, hiterfece 12 
may have any of the structures disclosed m the preceding, except tiiat the control ou^uts of 
manories 16a,b are coiq)led to control circuit 20 (not shown separately) and the data ou^uts 
of memories 16a,b are coiq)led to inputs of die multiplexers (not shown separately). In 
interface 12 the outputs of the memory are coupled to tiie contool circuit (not shown 
separately) which selects a combination of words of less than fiill size that wiU be placed on 
the bus. Subsequently, the control circuit causes the selected placement to be effected on the 
data lines of the bus. As fer as write data is involved, the control circuit causes the associated 
address to be placed on the address Imes of the bus sequentially, for example in a sequence 
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accoiding to tide order in which the corresponding data is placed on die data lines. Id time 
slots that correspond to read data no address need be placed on the bus, but of course flie 
address lines may be used for issuing another read request 

Table HI shows an exan^le in \i^ch words of write data WD and words of 
5 read data RD are placed on bus 14. As can be observed in bus cycle 3, two quarter words of 
write data WDal', WDcl'and two quarter words of read data RDbl', RDdl' are placed on the 
data lines of bus 14 together. In bus cycles 3 and S the addresses for the write data are 
supplied. 

10 Table m further example of bus occiqpation 



BusQycleNo 


Address 


Data 


1 


Al 


WDal WDa2 WDdl WDd2 


2 


A4 




3 


Al' 


WDal' RDbl' WDcl' RDdl' 


4 


X 




5 


A3' 




6 


X 




7 


A2" 


WDbl" WDb2" WDb3" WDb4" 


8 







It may be noted that no read addresses need be supplied for the read data 
(indicated by X in the bus cycles numbered 4 and 6), since the read address is supplied in 

15 advance, before reading the data. The control circuit may be arranged to indicate to the data 
handling units for which of the data handling unit Ihe read data word is intended, for example 
on the basis of information about an oldest read request from a queue of as yet unanswered 
read requests form respective m^nories, in which queue the data handling unit that has 
issued (he request is recorded. 

20 In an embodiment tihe bus (^les without address may be used to supply read 

addresses for new read requests. In another embodiment these bus cycles are omitted, e.g. in 
table in bus cycles 4 and 6 would be omitted so that there is only one intermediate bus cycle 
between cycles 3 and 6 in which Ae data words are placed on the bus. M this embodiment the 
control circuit 20 is preferably arranged to provide a signal to memories 1 6a,b for each 

25 address, to indicate to which of the less than fiiU size data words in a packed data word the 
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address applies. In the case that read results are included in the packed data word, the control 
circuit 20 optionally is arranged to provide a signal to processors lOa-d to indicate which of 
the less than ftill size data words in a packed word are the result of read requests, so that the 
processors are enabled to select the read results, but alternatively, of course the circuit may 
be provided with multiplexers that enable control circuit 20 to ensure that the read results 
from the packed words are fed to those processors that issued the read requests that resulted 
in the read results. 

It should be noted that the above-mentioned emhndimentf? illustrate rather than 
limit the invention, and that those skilled in the art will be able to design many alternative 
embodiments without departing from the scope of the fended claims. In the claims, any 
reference sign placed between parentheses shall not be construed as limiting the claim. The 
word "con^rising" does not exclude the presence of elements or steps other than those Hsted 
in a claim. The word "a" or "an" preceding an element does not exclude the jiresmce of a 
plurality of such elements, hi the device claim enumerating several means, several of these 
means can be embodied by one and the same item of hardware. The mere fact that certain 
measures are recited in mutually different dependent claims does not indicate that a 
combination of these measures cannot be used to advantage. 
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CLAIMS: 



1 . An electronic data piocessing ciicuit^ the circuit conqnising 

a plurality of data handling units (lOa-d, 16a-b) with data outputs, at least part 
of the data handling units having address outputs; 

a hus (14) with address lines and data lines, the data lines supporting 
5 simultaneous transfer of up to a maximum numher of bits in a bus cycle; 

a bus controller (20) coupled to the data handling units and arranged to control 
access to the bus in successive access cycles, the bus controller (20) being arranged to cause 
data bits from a plurality of data words of less than said maximum number of bits, from 
respective ones of the data handling units (lOa-d, 16a-b), to be placed in combination on flie 
10 data lines in a same bus cycle, the bus controller (20) cavising write addresses that the 

respective ones of the data handling units (lOa-d, 16a-b), supply for respective ones of the 
plurality of data words to be placed on the address lines in a plurality of respective bus 
cycles. 

15 2. An electronic data processing circuit according to Claim 1 , wherein the data 

handling units (lOa-d) support a variable word size, the bus controller (20) adapting a number 
of words in the plurality of data words that is placed on the data lines to the word size or 
word sizses supplied by the data handling units (lOa-d, 16a-b). 

20 3, An electronic data processing circuit according to Claim 1 , wherein the bus 

controll©: (20) is arranged to select a distribution of which bits of the plurality of words will 
be placed on vMch of the data lines in said same cycle, dqpendent on an evaluation that takes 
account of a number of data lines that will change lo^c level upon placmg the bits of the 
plurality of words on the data lines, the bus controller (20) selecting a distribution that 

25 minimizes the number of data lines that will change logic level among at least two possible 
distcibutions. 
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4. An electronic data processing circuit according to Claim 3, wherein the hus 
controller (20) selects the distributions from permutations of placements of the plurality of 
words on the data lines. 

5. An electronic data processing circuit according to Claim 4, wherein the bus 
controller (20) causes the addresses that the respective ones of the data handling units supply 
for respective ones of the plurality of data words to be placed on the address lines in 
successive bus (^cles, in a sequence wherein a position of the address is dependent on the 
position where the corresponding word is placed on ffae data lines. 

6. An electronic data processing circuit according to Claim 1 , wherein at least 
one of the data handling units (16a-b) is a memory unit, the bus controller being arranged to 
include^ among said plurality of data words that is placed in combination on the data lines in 
a same bus cycle, a read result produced by said memory unit (1 6a,b) when available. 

7. An electronic data processing circuit according to Claim 6, wherein the control 
circuit (20) is arranged to adjust a number of bus cycles in the plurality of respective bus 
cycles in whidh addresses are placed on the address lines for respective ones of the plurality 
of data words, said number being adjusted dependent on a further number of read results that 
is included in the plurality of data words. 

8. An electronic data processing circuit according to Claim 1, comprising a data 
receiving circuit (16a,b) coupled to the bus (14), the data receiving circuit (16a,b) being 
arranged to handle multi bus cycle data transfers via the bus (14) using a single start address 
supplied via the address lines, the bos cunixoiler (20) being arranged to place the addresses 
that the respective ones of the data handling units (lOa-d) supply for respective ones of the 
plurality of data words on the address lines in bus cycles corresponding to a multi bus cycle 
data transfer wherein the multi bus cycle data t ranRfer involves no address transfer. 

9. An electronic data processing circuit, the circuit comprising 

a plurality of data handling units (1 Oa-d, 1 6a,b) with data outputs, at least part 
of the data handling units (10a-<Q having and address outputs for writing words of data; 
a bus (14) with address lines and data lines; 
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a bus cqntroller (20) coupled to tiie data handling units and arranged to control 
access to Ifae bus (14) in successive access cycles, the bus controller being arranged to select 
a distdbution of when and/or j/Aterc words fiom the data handling units will be placed on Ihe 
data lin^ of the bus (14), dependent on an evaluation that takes account of a number of data 
S lines that will change logic level upon placing the words on the data lines, the bus controller 
(20) selecting a distribution Ifaat minimizes the number of data lines that will dbange logic 
level among at least two possible distributions. 

10. An electronic data processing circuit according to Claim 9, wherein the bus 
10 controller (20) is arranged to select the distribution of a plurality of words that are placed on 

Ifae data lines inparalleL 

11. An electronic data jjrocessing circuit according to Claim 9, wherein the bus 
controller (20) is arranged to select a sequence of a plurality of words that are placed on the 

IS data lines or a subset of the data lines in series. 

12. A method of processing data, the method comprising 

supplying a plurality of data words with variable word size and addresses for 
those data words; 

20 - placing data bits from the plurality of data words of less than the maximum 

word size on data lines of a bus (14) in a same bus cycle if more than one of the plurality of 
data words has a word size of less than the maximum word sl2c; 

placing the addresses for respective ones of the plurality of data words on 
address lines of the bus (14) in a plurality of respective bus cycles. 

25 

13 . A method of processing data, the method comprising 

supplying respective data words for output on data lines of a bus (14); 

selecting distribution of the data words over the data lines and/or over a 
teooporal sequence in which the data words will be placed on the bus (14), taking account of a 
30 number of data lines whose logic level will change upon placing the words on the data lines, 
the distribution being selected to minimize the number of data lines that will change logic 
level among at least two possible distributions; 

placing the data words on the data lines of the bus (14) according to the 
selected distribution. 
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ABSTRACT: 



An electronic data processing dicait contains a plurality of data handling units 
(lOa^ 16a-b) with data ou^mts, at least part of the data handling units having address 
ou^uts. The data handling units si^ly words of preferably selectable size to a bus. A bus 
controller (20) is arranged to control access to the bus in successive access cycles. The bus 

5 controller (20) causes data bits fiom a phiraUly of data words fiom respective ones of the data 
handling units (lOa-d, 16a-b), to be placed in combination on the data lines in a same bus 
cycle. The bus contioUer causes write addresses that the respective ones of the data handling 
units (lOa-d, 16a-b) supply for respective ones of the plurality of data words to be placed on 
the address lines in a pluraUty of respective bus cycles. Preferably, the temporal or spatial 

10 arrangement of the data words on the bus Unes is adqrted so as to mimmize the number of 
logic level changes on the bus. 
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